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Abstract —This paper is concerned with distrihnted computa¬ 
tion of several commonly used centrality measures in complex 
networks. In particular, we propose deterministic algorithms, 
which converge in finite time, for the distributed computation of 
the degree, closeness and betweenness centrality measnres in di¬ 
rected graphs. Regarding eigenvector centrality, we consider the 
PageRank problem as its typical variant, and design distributed 
randomized algorithms to compute PageRank for both fixed and 
time-varying graphs. A key feature of the proposed algorithms 
is that they do not require to know the network size, which can 
be simultaneonsly estimated at every node, and that they are 
clock-free. To address the PageRank problem of time-varying 
graphs, we introduce the novel concept of persistent graph, which 
eliminates the effect of spamming nodes. Moreover, we prove that 
these algorithms converge almost surely and in the sense of U’. 
Finally, the effectiveness of the proposed algorithms is illustrated 
via extensive simnlations using a classical benchmark. 

Index Terms —Complex networks, centrality measures, dis¬ 
tributed computation, randomized algorithms, convergence prop¬ 
erties. 

I. Introduction 

Centrality measures refer to indicators which identify the 
importance of nodes in a complex network. As first developed 
in social networks, many of them were introduced to reflect 
their sociological origin |[T). Nowadays, they have become 
an important tool in network analysis, and are widely used 
for ranking the personal influence in a social network, the 
webpage popularity in the Internet, the fast spread of epidemic 
diseases, and the key infrastructure in urban networks. In fact, 
the ranking of a large number of objects is one of the most 
topical problems in information systems. Depending on the 
specific application, different centrality measures may be of 
interest. In this work, we are interested in the commonly used 
degree Q, closeness |[^, betweenness Q and eigenvector Q 
centralities in complex networks. 

As the network size becomes increasingly large, it is usually 
very difficult to compute centrality measures, except for the 
degree centrality which is of limited use. To address this issue, 
it is of great importance to design distributed algorithms with 
good scalability properties for their computation, where each 
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node evaluates centralities by only using local interactions. 
Although distributed algorithms may play a significant role 
in alleviating the computational burden, the access to limited 
information renders it challenging to ensure that each node 
provides its exact centrality. This requires a rigorous and 
challenging analysis regarding convergence properties of these 
algorithms, in particular for the PageRank computation. 

We recall that other communities, such as sociology, biol¬ 
ogy, physics, applied mathematics and computer science, see 
0, ©-d) and references therein, focused their attention 
on network centrality. In particular, within computer science, 
several authors studied the topic of distributed computing 
from the networking viewpoint 0. which however provides 
a different viewpoint than that presented in our paper. We 
also remark that the computation of the degree, closeness 
and betweenness centralities are closely related. For instance, 
calculating the betweenness and closeness centralities in a 
network requires calculating the shortest paths between all 
pairs of vertices. While the degree centrality is trivial, the 
computations of betweenness and eigenvector centralities have 
been extensively studied. Numerous algorithms have been 
designed to compute the betweenness centrality, including 
Floyd-Warshall algorithm 0 . Johnson’s algorithm 0 and 
Brandes’ algorithm 0 . On unweighted graphs with loops and 
multiple edges, calculating the betweenness centrality takes 
0(|V| • |£:|) time using the classical Brandes’ algorithm, where 
|V| and \S\ denote the number of nodes and edges of a graph, 
respectively. However, these algorithms are centralized and 
rely on global information of the network. 

Recently, distributed algorithms for computing the between¬ 
ness and closeness centralities in an undirected tree have been 
proposed in p8) , p9) via a dynamical systems approach and 
in | [20) , where a scalable algorithm for the computation of 
the closeness, based only on local interactions, is proposed. In 
particular, every node computes its own centrality under only 
local interactions with its neighbors. In this paper, we propose 
finite-time convergent algorithms to distributedly compute the 
closeness centrality of a directed graph and the betweenness 
centrality of an oriented tree from the perspective of partition¬ 
ing the network into multi-levels of neighbors. Our algorithms 
take advantage of the fact that a tree does not contain any loop, 
and therefore every pair of nodes has at most one shortest path. 

Another important measure of centrality is eigenvector cen¬ 
trality, which is defined as the principal eigenvector of an adja¬ 
cency matrix of the graph. It measures the influence of a node 
by exploiting the idea that connections to high-scoring nodes 
are more influential. That is, “an important node is connected 
to important neighbors.” In various applications, the notion 
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of eigenvector centrality has been modified for networks that 
are not strongly connected, for example in systems biology 
0, Eigenfactor computation in bibliometrics | [22) and Web 
ranking In this case, a key idea is to introduce a so- 
called “teleportation factor” which includes a free parameter 
generally set to 0.15, see p4) , p5) for further details. The 
resulting modified network then becomes strongly connected. 
In particular, for Web ranking, this modification leads to the 
well-known PageRank ig, which has attracted significant in¬ 
terest from the systems and control community |241, @,1^. 
Our interest regarding the eigenvector centrality is particularly 
focused on the distributed computation of PageRank. 

Currently, the number of webpages in the Internet is incred¬ 
ibly large, and it is not even exactly known. This observation 
raises two interesting questions; (1) How to distributedly 
estimate the network size? Under local interactions, this 
problem is nontrivial as we cannot ensure to count every 
page, and some page(s) might be counted more than once. 
(2) Without knowing the network size, how to compute the 
PageRank by only using local interactions? To the best of 
our knowledge, these problems remain widely open. If the 
network is time-invariant, its size is known and a global clock 
is available, several distributed randomized algorithms have 
been proposed in p8) for calculating the PageRank. The idea 
lies in the design of the so-called distributed link matrices 
by exploiting sparsity of the hyperlink structure. Then, each 
page is randomly selected to update its importance value by 
interacting with those connected by hyperlinks. However, the 
randomization is based on an independent and identically 
distributed process (i.i.d.) and should be known to every page. 
The need for this global information is indeed critical and 
the i.i.d. assumption is certainly not mild. Furthermore, the 
algorithms are based on a time-averaging operation, which in¬ 
evitably slows down convergence. Other distributed PageRank 
algorithms are provided in p4) , |27|, p9) , pO) and references 
therein. 

To suitably address the above questions, in this paper, we 
first reformulate the PageRank problem from the least squares 
(LS) point of view, and then propose a randomized algo¬ 
rithm to incrementally compute the PageRank. Specifically, 
we consider a Web random surfer exploring the Internet. 
When browsing a webpage, the surfer incrementally updates 
an estimate of the PageRank by using importance values of the 
pages that have outgoing hyperlinks to the current page. Then, 
he/she will either randomly select an outgoing hyperlink of the 
current page and move to the page pointed by this link, or jump 
to an arbitrary page of the Internet, after which the PageRank 
estimate is updated again. This process results in a new type 
of the celebrated Kaczmarz algorithm ED with Markovian 
randomization, instead of a simpler i.i.d. randomization which 
may not adequately represent a Web surfer model. In this case, 
the proof of convergence (both almost surely and in the 
sense) requires to exploit deep properties of Markov processes 
in the theory of stochastically time-varying systems | [^ . 

Remarkably, the proposed randomized algorithms can be 
conveniently implemented in a fully distributed manner, and 
each node simply maintains an estimate of the importance 
values of its neighbors and itself. In addition, every node only 


requires to know the number of neighboring nodes, rather 
than the total number of nodes in the network. Interestingly, 
the network size is simultaneously estimated with probability 
one by an individual node. We point out that, even if the 
PageRank computation is reformulated as a least squares 
problem, existing distributed optimization algorithms pO) , 
P^-PD are not directly applicable because they require the 
knowledge of the network size. 

For the case of temporal networks described by time-varying 
graphs, where hyperlinks vary over time, but the network 
size is assumed to be constant, an interesting problem is 
how to define the PageRank to measure the importance of 
each node. To the best of our knowledge, this problem has 
never been studied in the systems and control community. 
Therefore, a persistent graph is introduced to eliminate the 
effect of transient hyperlinks. Intuitively, PageRank should not 
be affected by spamming links, and the importance value of a 
spamming page should be negligible. Our approach is indeed 
very useful to deal with spamming nodes which create a large 
volume of hyperlinks in a short period. A persistent graph adds 
large weights on persistent hyperlinks, and a larger weight 
on more recent hyperlinks. Then, the proposed incremental 
algorithm is generalized to address temporal networks with 
time-varying links and its convergence properties are also 
rigorously established. 

In summary, the main contributions of this work are at least 
threefold. First, distributed algorithms with finite-time conver¬ 
gence are derived to compute the closeness and betweenness 
centralities in a unified setting. Second, we reformulate the 
PageRank problem as a FS problem and provide a new type of 
Markovian Kaczmarz algorithm, with rigorous convergence, to 
compute the PageRank. The algorithm can also be distribut¬ 
edly implemented even with unknown network size. Third, 
a novel concept of persistent graph is adopted to effectively 
study the PageRank problem over time-varying networks. 


The rest of the paper is organized as follows. In Section 
[n] we provide an overview of centrality measures and the 
PageRank problem in both fixed and time-varying graphs. 
In Section m we design deterministic algorithms to in¬ 
crementally compute the degree, closeness and betweenness 
centrality measures. In Section IV the PageRank problem 
is reformulated as a least squares problem, based on which 
incremental algorithms are introduced to distributedly compute 
the PageRank. In Section the incremental algorithms are 
randomized by mimicking the behavior of a random surfer. 
We also prove convergence of the randomized incremental 
algorithms to the PageRank. The case of temporal networks 
with time-varying links and related convergence properties are 
studied in |Vll Simulation results for a classical benchmark 
are included in |VII| Some concluding remarks are drawn in 
Section IVm] 


Notation: For any vector x G K" and p > 0 , let ||a:||p = 
(Er=i If p = 2, we simply write ||a;|| = ||a:||2, 

which denotes the Euclidean norm of a vector. If A is a 
matrix, we use ||A||p to denote the matrix norm induced by 
the vector norm || • ||p, i.e., ||A||p = sup||2,|| ||Aa:||p. The 

norm ||A||lp is defined by ||A||lp = (E||A||p^/P, where E 
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denotes expectation. The symbol 1 represents a vector with 
all elements equal to one. For a symmetric matrix A, the 
notation A > 0 (A > 0) means that A is positive semi-dehnite 
(dehnite), and the relation A> B means that A — B > 0. 

II. Centrality Measures and PageRank 

Ranking nodes is a crucial question in network science and 
has attracted a lot of attention in several scientihc commu¬ 
nities. A key problem is how to discern the importance of 
each node, which can be formalized by dehning a centrality 
measure. In this section, we review some basics of complex 
networks and their centrality measures before concentrating 
on their distributed computation by means of deterministic 
and randomized algorithms. 

Given a graph Q = {V,£}, where V := A^} is 

the set of vertices (nodes) and £ is the set of edges, let 
A = denote its adjacency matrix, i.e. = 1 if 

vertex i is linked to vertex j and 0, otherwise. That is, an 
ordered pair {i,j) G £ if and only if = 1. A forward 
path in 0 is a sequence of nodes (ni ,... ,nk) with k > 2 
and a corresponding sequence of edges {ni,ni+i) G £. The 
distance of a path is defined as the sum of atj on the path. Q 
is strongly connected if any pair of vertices are reachable from 
each other via a forward path. Let be the set of nodes that 
are reachable from node i via exactly k directed edges, and 
let be the set of nodes that are linked to node i via exactly 
k directed edges. Moreover, self-loops are not allowed, i.e., 
flii = 0 for all i S V and Vi = V \ {z}, which is the set of 
nodes excluding node i. 


A. Centrality Measures 

Depending on the specific application, different notions of 
centrality may be of interest. The most intuitive notion is 
arguably the degree of nodes, that is the number of neighbors 
of each node. The out-degree of node i is defined by 

Di = ^ ^ dij • ( 1 ) 

fev 


Due to its local nature, this centrality measure is of limited 
use, and may fail to capture the actual role of the node 
in the network. Hence, more refined definitions have been 
proposed. Classical measures include closeness, betweenness, 
and eigenvector centralities. 

In strongly connectecQ networks, a node is considered more 
important if it is closer to other nodes in the network. While 
fairness of a node is the sum of its shortest distances to all 
other nodes, closeness was defined in Q as the reciprocal of 
the fairness, i.e.. 


a = 




( 2 ) 


where d{i,j) denotes the shortest distance of the forward path 
from node i to node j. 


’When a graph is not strongly connected, a widespread idea is to use 
the sum of reciprocal of distances, instead of the reciprocal of the sum of 
distances, with the convention l/oo = 0. 


The betweenness Q quantifies the “control” of a node on 
the communication between other nodes, and ranks a node 
higher if it belongs to more shortest paths between other two 
nodes in the network. Formally, the betweenness of node i is 
dehned by 




E 

3,k&Vi 




(3) 


where a{j, k) denotes the number of shortest paths from node 
j to k, and a{j, k, i) denotes the number of shortest paths from 
node j to k containing the node i. 

Eigenvector centrality measures the influence of a node in 
a network by the entries of the principal eigenvector of the 
adjacency matrix, i.e., the score of eigenvector centrality of 
vertex i is dehned as 

1 

dijXj (4) 

^ j&v 

where A is the leading eigenvalue of A Q. If A is primitive, it 
follows from Perron-Frobenius theorem that all the elements 
of X are positive p8| . 

The well-known PageRank is a special case of this centrality 
measure when the adjacency matrix is suitably redehned as a 
column stochastic matrix, often called the hyperlink matrix. 
This notion is discussed in detail in the next subsection. 


B. The PageRank Problem 

The PageRank of the Internet is a typical application of 
the eigenvector centrality for ranking webpages, and has 
recently attracted the attention of the systems and control 
community p8) . In this subsection, we adopt this formulation 
for the PageRank problem of static graphs, and extend it to 
temporal networks of time-varying graphs, which is based on 
the concept of persistent graph. 

1) The PageRank Problem in Static Graphs: The basic idea 
in ranking pages in terms of the eigenvector centrality is that 
a page having links from important pages is also important 
p8| . This is realized by determining the importance value 
(eigenvector centrality of a weighted adjacency matrix) of a 
page as the sum of the contributions from all pages that have 
links to it. In particular, the importance value Xi G [0,1] of 
page i is dehned as 



It is customary to normalize the importance values so that 
view of (|^, the hyperlink matrix W := 
{wij) G is dehned by 


Wij 


1/Dj ifjGCl, 
0 otherwise. 


(6) 


X = Wx, l^x = 1, and x G [0,1]^. (7) 

If Q is not strongly connected, then W may be reducible 
and X is not unique. To get around this problem, we consider 
a random surfer model p^ . Particularly, a surfer may follow 
the hyperlink structure of the Web or randomly jumps to 
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Fig. 1. A directed graph with six nodes. 


other pages with equal probability (which is denoted 


“teleportation” in |28|). To accommodate this behavior, we 
adopt a modified hyperlink matrix M G which is a 

convex combination of two column stochastic matrices, i.e.. 


m. 


M := (1 -m)W + —11^ 


( 8 ) 


where m is a parameter such that m S (0,1), and denotes 
the probability of restarting the random surfer at a given step. 
This assures that; (a) any node is reachable from all the other 
nodes, so that the graph is strongly connected; and (b) the 
Markov chain generated by M is aperiodic and irreducible 
p9] . Usually, the value m = 0.15 is chosen at Google | |2^ . 

In line of p4) , equation 0 with W replaced by M is 
referred to as the PageRank equation. Note that M is a 
column stochastic matrix with all positive entries. By Perron 
Theorem p8) , one is the leading eigenvalue of M, and has 
algebraic multiplicity equal to one. Moreover, the principal 
eigenvector of M is unique (within a multiplier), and has all 
positive elements. This implies that the PageRank equation is 
written as 

X = Mx and l^a; = 1, (9) 


and admits a unique solution x G [0,1]^. Clearly, Xi also 
dehnes the eigenvector centrality of node i in the Internet. 

We use the following example to illustrate the above cen¬ 
trality measures. 

Example 1. Consider a directed graph with six nodes in 
Fig. 0 The adjacency matrix and hyperlink matrix are given 
by 
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For comparison, the degree, closeness and betweenness 
centrality measures are normalized and the normalized version 
is provided in Table from which we may conclude that the 
nodes 3 and 4 are two most important nodes, while nodes 1 
and 5 are the two least important ones. 

2 ) The PageRank Problem for Temporal Networks: Obvi¬ 
ously, the network of webpages is time-varying due to the 
creation or deletion of hyperlinks. In this paper, we consider 
the case where hyperlinks between webpages are time-varying 


TABLE I 

Centrality measures for the graph in Fig.[T] 


centrality 

I 

2 

3 

4 

5 

6 

Degree 

.1667 

.1667 

.2500 

.1667 

0.0833 

.1667 

Closeness 

.1708 

.1708 

.2196 

.1708 

.1281 

.1398 

Betweenness 

.0217 

.1957 

.1957 

.4130 

0 

.1739 

PageRank 

.0727 

.1122 

.1986 

.2963 

.1131 

.2072 


and the number of nodes is constant, i.e., Q{k) = {V,£(fc)}, 
where £{k) denotes the set of hyperlinks at time k. Then, the 
hyperlink matrix is no longer fixed and varies over time. An 
interesting problem is how to define the PageRank in this case. 

Intuitively, a spamming page is a webpage that has too 
many outgoing hyperlinks in a short amount of time. From 
this perspective, the spamming pages have only a signihcant, 
but short and negative effect on the network. However, a 
reasonable PageRank definition should not be much affected 
by spamming pages, and the importance value of a spam¬ 
ming page should be as small as possible. To formalize this 
observation we introduce a persistent graph where the effect 
of spamming (transient) links will be eventually excluded, and 
only the persistent links may significantly affect the PageRank 
value. Specihcally, we define a persistent hyperlink matrix as 


W = 


lim -- 

k—foo 1 “h • • • “h 


^ t=l 


( 10 ) 


where g G (0,1] is a forgetting factor and provides larger 
weights on the more recent hyperlink matrice^ Then, the 
modified link matrix is given by 

M = (l-m)W+ (11) 

Clearly, this definition includes the case of static graphs, and 
does not require the stationarity of the network. Based on 
previous discussions, M is a column stochastic matrix with 
positive entries, and the PageRank equation is expressed as 

X = Mx and l^x = 1. (12) 


III. Distributed Computation oe Degree, 
Closeness and Betweenness Centralities 

In this section, we focus on the distributed computation 
of the degree, closeness and between centralities in the net¬ 
work Q = {V,£). Obviously, the computation of the degree 
centrality is trivial. The key technique for the closeness and 
betweenness centrality measures lies on the computation of 
the shortest distance for each pair of nodes in the graph. 
This problem is solvable in a centralized manner by a linear 
program Here we propose a distributed algorithm to 

incrementally compute the shortest path between two nodes, 
which is then used to compute the closeness centrality of a 
directed graph and the betweenness centrality (the latter in 
the special case of an oriented tree). 

^Here we implicitly assume that the limit exists. 
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A. Degree and Closeness Computation 

Let c?max = maxj ,gv which is also obtainable in a 

distributed manner pi) . Then, 7?,* = 0 for all t > dmax and 

d(Lj) =f,Vj G7^‘ (13) 

and 

C^max 

{jTllC V,. (14) 

t=i 

If Q is strongly connected, the equality holds. Otherwise, 
there exists a node that can not be reached from node i and 
the left hand side of ( [T4| ) is a strict proper subset of V^. It 
should be noted that TZ^ can be empty, and TZi n TZ^ = 0 if 
k ^ j. By @ and ( [l4] |, the computation of d{i,j),j G Vi is 
the same as the task of partitioning the set Vi. By the local 
(one-hop) interaction, the {t + l)-hop neighbor set of node i 
is recursively given by 

t 

7^‘+l = IJ - U (15) 

J&Tll 

That is, the one-hop neighbors of node i send their <-hop 
neighbors to node i, and node i checks whether they belong 
to its fc-hop neighbors, fc < f. If not, then this node has a 
minimum distance f -|- 1 to node i. Obviously, this algorithm 
relies only on the local interaction with one-hop neighbors and 
it is provided at the end of this section. 

B. Betweenness Computation of Trees 

The betweenness centrality is essential in the analysis of 
social networks, but costly to compute for a generic graph. 
A space and time efficient centralized algorithm has been 
proposed in GD- In this section, we propose a distributed 
method to compute the betweenness centrality measure for an 
oriented tree, which is a directed acyclic graph p2) . 

Note that a tree does not contain any cycle, and has the key 
feature that the number of shortest paths between two nodes 
is always equal to one. This implies that 

Bi= ^ (16) 

jVeVi 

To compute Bi, define the set of all reachable nodes from 
node i by 

f^max 

= U (17) 

t=i 

We also denote the set of reachable nodes from node i via 
the directed link {i,j) G £ by TZi^j. By convention, if there 
is no edge from node i to j, we set TZi^j = 0. Since Q is 
an oriented tree, then for any k f j, TZi^j n TZi^k = 0- To 
elaborate it, suppose that there exists a node v G TZi^jCdZi^k- 
Then, there exist two directed paths from node i to node v, 
respectively, via node j and node k. If we replace directed 
edges with undirected ones, we obtain a cycle starting from 
node i via node v to node i. This is in contradiction to the 
definition of an oriented tree. Hence, it is clear that 

= U - 

i6V 


Similarly, define the set of all nodes linking to node i by 
Li and the set of nodes linking to node i via the directed link 
(j, i) G £ by Lj^i, which again satisfies that 

~ U 

iev 

Now, we show how to compute the betweenness of node 
i. If there exists u and v such that a{u,v,i) = 1, we can 
always find a pair of nodes j and k f j such that u G Lj^i 
and V G TZi^k- Conversely, for any pair of nodes u G Lj^i 
and V G TZi^k, we obtain that a-{u,v,i) = 1. Otherwise, it 
contradicts the definition of an oriented tree. This implies that 
betweenness of node i can be explicitly computed by 

B,= (18) 


In summary, the distributed computation of the degree, 
closeness and betweenness centrality measures are given in 
Algorithm [T] We remark that the closeness and betweenness 
centralities are computed for an undirected tree from a dy¬ 
namical system point of view in 118|, 1191. 


Algorithm 1: Distributed computation for degree, close- 
ness and betweenness centralities_ 

• Initialization; for every i G V, compute TZ\ and C\ by 
local interactions; 

• For t < dmax- Given T?.* and £j, which are obtained 
from local interactions with j G TZ\ and j G C\, 
respectively. Compute 

• Compute the cardinality of TZi^j and Lj^i by 

^max ^max 

^ |7^‘| and ^ \L]\. 

t=i t=i 

Then, output the degree, closeness and betweenness 
centralities by 

A = \nl\; 

^max 

Bi = 'y ^ |72.i_>.j| • |£fe_>.i|. 

jenfkecfji^k 


Remark 1 (Computation Complexity). In Algorithm the 
number of operations in each node is 0(|V| • |7?.^axl) 
every t where |7^maxl = max^gy |- As t < dmax, the 
total complexity of a single node is 0{\V\ ■ |7^maxl ’ Aax). 
which is smaller than the centralized Brandes’ algorithm 
0(1 V| ■ |£|). However, the total number of operations in the 
whole network is 0(| Vp • \TZ]^^f\ ■ dmax), which is larger than 
Brandes ’ algorithm due to the limited access of information 
in computation. 
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IV. Distributed Computation of PageRank 

In this section, we provide incremental algorithms to dis- 
tributedly compute the PageRank, which is also a special case 
of eigenvector centrality, under two scenarios depending on 
whether an individual node has the knowledge of the network 
size. We reformulate the PageRank computation as a least 
squares problem, and propose a new type of randomized 
Kaczmarz algorithm to incrementally compute the PageRank. 
The essential idea lies in the integration of the randomized 
incremental algorithm with a random surfer model. The strik¬ 
ing features of our algorithm are at least threefold; (a) It can 
be conveniently implemented in a fully distributed manner by 
using only local information of an individual page, (b) It is 
based on Markovian randomization (instead of simpler i.i.d. 
randomization), which accounts for the Web structure quite 
well, (c) It can be simply generalized to accommodate tem¬ 


{I - (1 - m)W)x = —1. 


(19) 


N 


^(1-to)*1V*1. 


2 = 0 


By it is clear that l^W = 1^. This implies that 

^ CXD 




m 

2=0 


iyvwn = 


m 


(l-my = 1. 


2=0 


m 


X = argmin^gR«||(J- (I -m)lV)a;- ^if. 


( 20 ) 


yi = m/N. The optimization problem (20i is easily rewritten 
as a LS problem 


N 


= argmin^gR„ 


( 21 ) 


Note from (|6]l that Hi is computable by using only information 
from the neighbors with outgoing links to node i, which is 
obviously known to node i. Usually, H is a sparse matrix, 
and contains many zero entries. 

Similarly, the LS reformulation of the PageRank problem 
for temporal networks is given as 


N 


x^" = argmin^gjj« 


( 22 ) 


poral networks, as discussed in Section VI Further comments 
and connections with the Kaczmarz algorithm are given in 
Remark |7] in Section V. 

A. Least Squares Reformulation of the PageRank Problem 

Now, we reformulate the problem of solving the PageRank 
equation Q as a least squares problem by using the following 
result. 

Lemma 1 (Equivalent Equation for PageRank). The solution 
to 0 is equivalent to that of the following equation 


where Hi = Ci — {I — m)Wi and Wi is the i-th row of W. 
Next, we state a result which guarantees the solvability and 


uniqueness of the LS problems (20 1 and 


Lemma 2. For the PageRank computation, Hi in 

(HZ]) is positive definite. 

Proof: Note that I — {1 — m)W is non-singular. The 
positive dehniteness of Hf Hi follows from 

N 

'^Hj'Hi = {I - - {l-m)W) > 0. 

i=l 

Thus, the proof is completed. ■ 

By using standard result on LS techniques | [43) , it is obvious 
that the solution to 0 is exactly expressed as 


Proof: By = 1, then Mx = (1 — m)Wx = x, 
which implies 0. Conversely, it follows from that W is a 
column stochastic matrix. This implies that the eigenvalue with 
the largest magnitude of W is one. Since m strictly belongs 
to (0,1), we obtain that / — (1 — m)W is nonsingular, and its 
spectral radius is strictly less than one. Then, it follows from 
( [T^ that 

X = — 

m 


N 


N 


= {Y.HlHy- 


(23) 


That is, the solution to ( [T9| l also solves 0- ■ 

The advantage of the PageRank equation in (19i over (j^ 
is to drop the normalization constraint = 1, which 

implies that the PageRank can be solved via an unconstrained 
optimization as follows 


Obviously, the same arguments continue to hold for the 
time-varying graphs by directly substituting H with H. 

B. Incremental Algorithms with Known Network Size 

To obtain the LS solution, the formula in ( |2^ requires 
to utilize all yi and Hi for computing the inverse of a 
square matrix of order N, which is in fact the network size. 
Even worse, the sparsity of Hi is not used as the matrix 
Hi does not preserve a sparsity structure. Due to 
the size of the network, the computational cost of inverting 
the matrix in is very large. This motivates to design 
incremental algorithms to compute the PageRank x and x. 

Now we propose a randomized incremental algorithm with 
a diffusion vector x{k) G (shown in equation (24i below) 


to compute the PageRank. At iteration k, a node, indexed 
as s{k) G V, is randomly selected according to the method 
described in Section V-A. This page incrementally updates 
x{k) by performing a fusion algorithm 

1 d{ys(k)-Hs(k)xy ^ 

x(k if — x(k) \x=x{k) 


Then, the key problem is how to efficiently compute a; in a 
distributed manner. Let Wi be the i-th row of W, Hi = Ci — 
{l — m)Wi, where Ci is the i-th row of an identity matrix, and 


2N 

1 


= x{k) + - Hs(k)x{k)), (24) 


and the initial condition x(0) = 0. 
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In comparison with the traditional gradient algorithms 
the above algorithm only guarantees that a component 
iVsik) - of the total cost “ Hix{k)Y 

is improved. Since it does not take other components into 
consideration, the random process s(fc) should be carefully 
designed to reduce the total cost. Thus, a deeper investigation, 
performed in Section V, is required to prove that the iteration 
in p4|) solves the LS problem pT). 


Remark 2 (Computation Complexity). Due to the sparsity 
of the number of computations in (25i is small, e.g., 

the number of multiplications is only twice of the in-degree of 
node s{k) and scalable to the network size. Thus, it can be 
easily performed by a cheap processor. 


C. Distributed Implementation with Unknown Network Size 

Generally, the network size N, which is a global parameter, 
is unknown to an individual node. In this case, there are two 
critical issues for running the fusion algorithm p4| . The first 
issue is that both the stepsize 1/iV and ys(^k) = m/N are not 
available. To resolve this problem, we modify the incremental 
algorithm as 

x{k + 1) = x{k) + a{k) ■ (25) 

where ys(k) = • a{k), and the stepsize a{k) is given by 

/c 

^ t=o 

with the standard indicator function Xs(k){') being defined as 
Xs{k){x) = 1 if a; = s{k) and 0 otherwise. 

It is easy to check that a{k) counts the frequency of 
randomization of page s{k). Under relatively mild conditions, 
in Section V we show that a{k)~^ converges to N with 
probability one as the number of updates k tends to infinity. 
Roughly speaking, this implies that the measurement estimate 
ys(k) asymptotically converges to ys(k) = m/N. 

Remark 3 (Clock Free Algorithm). The total number of 
activations fc + 1 for a{k) can be locally obtained by the 
activated node via a local counter. In particular, the counter 
increases by one at each activation, after which it is passed to 
(detected by) the next activated node. Under this mechanism, 
the activated node is able to access the total number of 
activations, and does not require a clock in This is 

contrary to the randomized algorithms proposed in which 
indeed requires a global clock. 


The second critical issue regarding unknown network size 
is that the dimension of x(k) is unknown to an individual 
node. This implies that the iteration ( p5] ) can not be executed 
locally either. However, this is not a problem as only a 
coordinate block of x(k) in (251 is updated per iteration, 
and does not require any node to know the network size. To 
implement (25 i in a distributed manner, each page is in charge 
of computing a “sub-PageRank” for its neighboring pages and 
itself. Specifically, let x{k) = [xi(k),... ,XN{k)Y' and 


a;W(fc) = [x,(k),Xj(k),j € C-]^, 



Fig. 2. Local PageRank computation. If node n is initiated at iteration k, 
which is detected by its neighbors, eveiy neighbor sends its importance value 
Xj(k)^j E to this node for a local computation. Node n assembles their 
values and performs the iteration ig. The updated values are again detected 
by the neighbors. 


where x^''\k) consists of all the estimated importance values 
of the neighbors of node i and itself. By sequentially sorting 
out all nonzero elements of Hi and collecting them into a new 
vector Hi G whose dimension is much smaller than 


N for a sparse network, it follows from (251 that 




a;—■>(/(;+1) = -h 


(27) 


a(^) • H,(^k)<ais{k) - Hs(k)X 

This iteration is fully localized, see Fig. for illustration. In 
particular, the stepsize a{k) counts the percentage of updates 
that have been completed in page s{k), which is inherently 
known to this page without any global information. Similarly, 
i7s(fe) is solely decided by the incoming links to page s{k), 
which is again known to this page. It is also consistent with the 
observation that every page is only concerned with the rank of 
neighboring pages, and returns a sub-PageRank. Subsequently, 
each neighboring page detects its updated value in the sub- 
PageRank from page s(k). 

In summary, the fusion algorithm ( p7] ) can also be imple¬ 
mented in a fully distributed way for networks with unknown 
size. Thus, the remaining problem is to show the convergence 
of x(k) in (25 1 or ( p7| to the PageRank x. This will be 
addressed in Section V. Algorithm provides the distributed 
computation of PageRank with unknown network size. 


Remark 4 (Unknown Network Size). It is noteworthy to 
remark that the existing distributed optimization algorithms 
cannot be directly applied here because they require 
to know the network size. 


Algorithm 2: Distributed computation of PageRank with 
unknown network size _ 

• Initialization: for every i gV, set Xi(0) = 0 and 
s(0) = 0; 

• If s{k) = i, node j G C\ sends its importance value 
Xj{k) to this node for a local computation as in (271. 


Node j G £/ U {t} updates its importance value from 
Xj(k) to Xj(k + 1); 

Repeat. 












V. Randomization and Convergence oe the 
PagerANK eor Static Graphs 


In this section, we analyze randomized algorithms, which 
may be superior to deterministic algorithms in many aspects 
| |45] , for the PageRank computation of static graphs with both 
known and unknown network size. Then, we provide conver¬ 
gence results almost surely and in the sense of L^{\ < p < 2). 
The diffusion algorithm studied here is based on a specihc 
construction of the random process s{k) in (25 1 which reduces 
the total cost in (21 1 , and drives the iteration to converge to 
the LS solution. 


A. Randomization of s{k) 

First, motivated by the random surfer model described 
in Section II-B we design the random process s{k) which 
dictates how the nodes of the Web are selected. At time k, a 
random surfer has a prior vector x{k), and randomly browses 
a page, indexed as s{k). After inspecting the incoming links 
of this page and the number of visits to this page (to update 
the stepsize a{k)), the surfer incrementally updates this vector 
by (24 1 to x{k + 1). If s(fc) = i, the surfer randomly jumps to 
page j (j i) either by following the hyperlink structure with 
probability (1 — lS)P^, where uj G [0,1] is a parameter and 
is a probability that is defined based on the Web structure, 
or performs a random jump to page j with probability oj/JV. 
Here j = i means that the surfer refreshes the current page 
with probability {l—u})P^ or randomly returns to the current 
page with probability uj/N. 

Motivated by this discussion, we are now ready to define 
the Markov chain, and its transition probability matrix, of the 
random process s{k) in (25 i. 


Definition 1 (Transition Probability Matrix for Static Graphs). 
The increjnental process s{k) is a Markov chain with a 
transition probability matrix P = [1 —uj)P^ + where 

UJ G [0,1], and P^ is given by 


pW 


if{i,j)G£, 

1 ~ J2(i,k)es ifi= j) 

0 otherwise. 


The matrix P^ (generally called the Metropolis-Hastings 
matrix) is doubly stochastic, and it coincides a so-called min- 
equal neighbor scheme p6) . We notice that the second term 
11^/A^ in the transition probability matrix is motivated by 
the teleportation model described in Section |II-B| The convex 
combination parameter w G [0,1] represents two modes of 
randomization, which are studied in a unified setting; (a) 
centralized randomization of •UJ where w can be selected 
as any value in (0,1]. The extreme case w = 1 means that 
the random process s(k) reduces to an i.i.d. process as in 
the randomized Kaczmarz algorithm 0. Usually, a larger 
value of UJ implies better convergence performance of ( [24| . 
(b) distributed randomization of ( |27] i in Section |IV-C| where 
randomization is allowed to pass from one page only to its 
neighbors and the network size is unknown. 

The case (b) corresponds to w = 0 in the transition 
probability matrix P and is applicable only to a strongly 


connected Web. Suppose the Web is not strongly connected, 
there exists a node i that can not be reached from some other 
node j. Then, once node j is activated, node i can never be 
activated again. Even though the Web is generally not strongly 
connected, strong connectivity may be enforced in practice, 
as discussed in p^ . In particular, if the network contains 
dangling pages and is not strongly connected, the surfer may 
use a back button and return to the previously visited page to 
continue randomization. In the centralized randomization case, 
the second term of P requires that every pair of pages should 
be reachable from each other. This is a natural assumption, 
which is now formally stated. 

Assumption 1 (Strong connectivity). For the distributed ran¬ 
domization, i.e., u! = 0 in the transition probability matrix 
P = (1 — uj)P^ 11^ = P^, we assume that the network 
of webpages is strongly connected. 

A nice property, which follows from Definition [T] of transi¬ 
tion probability matrix and Assumption [^on strong connectiv¬ 
ity is that s{k) admits a unique stationary distribution, which 
is uniformly distributed over the set V. This implies that 

1 ^ 

^ = ^ 28 ) 
^ ^ i=l 

1 ^ 

i=l 

where the expectation operator E[-] is taken with respect to 
the random process s{k). 

B. Convergence Analysis 

The first convergence result we state for the randomized 
incremental algorithm is for known network size. 

Theorem 1 (Convergence with Known Network Size). Under 
Assumption on strong connectivity, the randomized incre¬ 
mental algorithm \24\ with transition probability matrix given 
in Definition enjoys the following properties: 

(a) There exist ko > 0, Mi > 0, and pi G (0,1) such that 
||a;(fc)—a;|| < Mi-p\ with probability one for all k > fco- 

(b) Given any p G [1, 2], there exist M 2 > 0 and p 2 G (0,1) 
such that ||a;(fc) — x IIiP < AI 2 • p\ for all k > 0. 

Remark 5 (Convergence with Known Network Size). If 
the network size is known, the randomized algorithm ( |24p 
exponentially converges to the PageRank almost surely and 
in the sense of L^il < p < 2). From this point of view, 
this convergence result is much stronger than those stated for 
the PageRank algorithms in Ili28^ , where mean square error 
convergence is stated with linear convergence rate 0{\/k), 
see further discussions in Section V-C. 

Remark 6 (Relations with Randomized Algorithms for Dis¬ 
tributed Optimization). The idea of using randomized incre¬ 
mental algorithms has been adopted in fflV'j , for solving 
distributed optimization problems. However, the convergence 
in these papers requires the stepsizes decreasing to zero, which 
inevitably reduces the convergence rate. In addition, the proof 
of convergence is completely different from that of Theorem^ 
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Remark 7 (Comparisons with the Kaczmarz Algorithm). 
There are several striking differences between the randomized 
diffusion algorithm studied in this section and the cel¬ 
ebrated randomized Kaczmarz algorithm in m which has 
been originally developed to solve generic linear equations. 

First, the random process s{k) is Markovian and takes into 
account the surfer’s browsing history. This case covers the 
i.i.d. process (which is assumed in the randomized Kaczmarz 
algorithm) as a very special case, i.e. in Definition oj is 
equal to one in the transition probability matrix P. Notice 
that Markovian randomization can be easily implemented by 
only allowing nodes to communicate with their neighbors. 
Therefore, it is a very general randomization scheme which 
is suitable to describe a random surfer model on the Web. 

Second, the use of Markovian randomization in \24\ makes 
the analysis of convergence much more difficult. Clearly, the 
techniques for studying convergence of the i.i.d. randomized 
Kaczmarz algorithm are no longer applicable. The same com¬ 
ment holds for random coordinate descent algorithms m- 
In fact, the proof of convergence of Theorem 1 is based on 
sophisticated technical results in the theory of stochastically 
time-varying systems /HI- More precisely, the key technical 
result is Lemma 3 stated below, which deals with convergence 
of transition matrices, and it is based on ergodicity properties 
of (j)-mixing processes. 

Third, contrary to the randomized Kaczmarz algorithm, 
the algorithms and the convergence results can be extended 
to temporal networks with time-varying links, which is a 
significant extension, subsequently provided in Section VI. 

Finally, we notice that, when applying the randomized Kacz¬ 
marz algorithm to the PageRank problem, the probability 
of choosing a node is proportional to the size of the regression 
vector, i.e., 

N 

¥{s{k) = t} = \myj2\m\^. ( 29 ) 

1=1 

This implies that the probability vector for randomization 
relies on the global information in the denominator of 1)29} . 
This is different from the randomized diffusion algorithm 
studied in this section, where the distribution of s{k) tends 
to be asymptotically uniform and is unknown to any node. 

The proof of Theorem [T] depends on a key technical result, 
which is stated in this section for completeness. The proof is 
given in the Appendix. 

Lemma 3 (Convergence of Transition Matrices). Let the tran¬ 
sition matrix be <i>(A; + 1, j) = (I — 1/N ■ j) 

for all j < k and $(fc, k) = I. Under Assumption^on strong 

connectivity, the randomized incremental algorithm ( |24p with 
transition probability matrix given in Definition enjoys the 
following properties: 

(a) There exists a positive M > 0 and p £ (0,1) such that 

mk,j)\\L2 < Mp^-fVk >j>0. (30) 

(b) There exists a sufficiently large kg > 0 and rj £ (0,1) 
such that with probability one 

if k-j > ko, (31) 


where [aij is the floor function, i.e. the largest integer 
not greater than a: £ M. 

(c) With probability one, it holds that 




< (X). 


(32) 


i=o 


Proof of Theorem 0 By @ and (j2T|, it is obvious that 
ys{k) = Let e(k) := x{k) — x, it follows from (|24]) 

that e(fc + l) = (I-l/N-Hj^^^Hsik))eik) = + 1, 0)e(0). 

The rest of proof is a trivial consequence of Lemma 0 ■ 

Next, we establish the convergence results of the ran¬ 
domized incremental algorithms ( [25] l and (27 1 with unknown 
network size. 


Theorem 2 (Convergence with Unknown Network Size). 
Under Assumption 0 on strong connectivity, the randomized 
incremental algorithms ( [27| ) with transition probability matrix 
given in Definition 0 enjoys the following properties: 

(a) (Almost sure convergence) limfe_>oo x(k) = x with 
probability one. 

(b) (LP convergence) limfc_>oo ||a^(^) ~ x\\lv = 0 for any 

p > 1. 

The effect of the stepsize a(k) on the randomized incre¬ 
mental algorithms is essentially the same as that of 1/A^ as 
formally stated in the next result. 

Lemma 4. Under Assumption 0 on strong connectivity, the 
stepsize with transition probability matrix given in Definition 
^satisfies the property 

lim a(k) = \/N. (33) 

k—foo 


Proof: Without loss of generality, we assume that s(0) has 
a uniform distribution over V. Then, s(k) is an ergodic and 
irreducible process. By the Ergodic Theorem p9) , it follows 
that 

lim a(k) = l/N, (34) 

fc—>-oo 

where the equality is due to the uniform distribution of s(j) 
for all j > 0. ■ 

Remark 8 (Exponential Convergence). By Theorem 0 once 
a{k) is close to 1/N, the convergence becomes exponential. 

Remark 9 (Estimation of the Network Size). Clearly, the 
network size N is a global information and is unknown to 
an individual node. Distributed estimation of the network size 
has been addressed in e.g. / [49^ using various techniques. In 
fact, the diffusion algorithm of this section can be simply used 
to locally estimate the network size by using the reverse of the 
stepsize, i.e., limfe_>.oo a~^(k) = N with probability one. 


C. Relations to the State-of-the-art 


In 1241, p8) , the PageRank problem is solved by designing 
the so-called distributed link matrices. Specihcally, every node 
i is associated to a link matrix Ai, whose i-th column coincides 
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with the z-th column of W of this paper. The distributed update 
scheme with randomization is of the form 

TTL 

y{k + l) = {1 - m)As(k)y{k) + — ■ 1, (35) 


x{k + 1) = 


TT='<‘’ + '> + 


(36) 


where the initial condition y(0) = a:(0) is chosen as any 
probability vector. Under the assumption that s(fc) is an i.i.d. 
process with a uniform distribution and m G (0,1), it is proved 
that limt_j.oo E[||x(f) —x|p] = 0 with a linear convergence rate 
in 1281 and limt_>.oo a:(f) = x almost surely in | [50| , pT) . 

In comparison, the algorithms of this paper are derived 
via an optimization approach, which incrementally improves 
the total cost under Markovian randomization. While the 
PageRank algorithm in ( |36| ) is motivated from the distributed 
implementation viewpoint, its convergence proof is much more 
involved even for i.i.d. randomization. For instance, almost 
sure convergence in depends on a stochastic approxima¬ 
tion algorithm with expanding truncation, while our algorithm 
converges in the sense of for any p > 0. 

From the implementation point of view, the computation 


of (351 requires knowledge of the network size N, and the 
initialization ?/(0) should be a probability vector, which is 
not easy to obtain with unknown network size. Finally, the 
PageRank problem in temporal networks with time-varying 
links can be addressed using the techniques provided in this 
paper, as discussed in the next section. 

VI. The PageRank Problem for Temporal 
Networks with Time-Varying Links 

In this section, we generalize the randomized incremental 
algorithms to the PageRank problem of time-varying graphs. 
In comparison with static graphs, we cannot simply replace 
Hs(k) by Ffs(fc) in <241 due to the causality constraints, which 


results in the unavailability of H at time k. By encoding each 
node with a processor to record its hyperlinks, a natural way 
to attack this problem is to use its estimate. Thus, we obtain 
the following revised incremental algorithm 

x{k -f 1) = x{k) + a{k)Hs(k){kl)^{ys(k) - Hs(k){k)x{k)), 
_ _ (37) 

where H{k) is the estimate of H at time fc, and is recursively 
computed by 


H{k) = 


Q 


1 + • • • -f 


-Hit) 


= H{k-l) + 


1 + • • • + p' 


■[W{k)-H{k-l)]. 


By (10 1 , it is clear that limfe_>oo 7T(fc) = H. Note that the 


algorithm in ( [37] i can be distributedly implemented as in (27 1 
with the same approach previously explained. 

For temporal networks with time-varying links, another 
problem is how to appropriately dehne the transition prob¬ 
ability matrix of the Markovian process s{k). Obviously, the 
transition probability matrix, which characterizes the random 
surfer browsing behavior, needs to be adapted to temporal 


networks. Following the approach used for static graphs, the 
Dehnition [T] is revised as follows. 

Definition 2. [Transition Probability Matrix for Time-Varying 
Graphs] The incremental process s{k) is a Markov chain with 
a transition probability matrix P{k) = (1 —^11^ 
where uj G [0,1] and p^(^) is given by 

( if{i,j)G£{k), 

pW{k)^\ Wik) . 

y I 2 ^{i,u)e£{k) ^iu I V — J, 

I 0 otherwise. 

Similar to Definition F’(fc) exploits the behavior of a 
random surfer when browsing webpages at time k in time- 
varying graphs, and the convex combination parameter oj G 
[0,1] represents the two modes of randomization (centralized 
and distributed) previously described. Flowever, the transition 
probability matrix is constant for fixed graphs, which implies 
that s{k) is a homogenous Markov chain. This fact does not 
hold for temporal networks. In this case, we are dealing with 
a time-heterogeneous Markov process, which usually requires 
a much more involved analysis than the time-homogeneous 
one. However, Lemmas and still hold and the convergence 
results can be proved under the following assumption. 

Assumption 2 (Joint Strong Connectivity). For the distributed 
randomization, i.e., oj — 0, we assume that there exists an 
integer Q > 1 such that the joint graph (V, (Z)) of 

the webpages is strongly connected for all k. 

The joint connectivity assumption ensures that every pair 
of pages are reachable from each other in finite time. We 
now state the main convergence result of the randomized 
incremental algorithm ( [24| for temporal networks. 

Theorem 3 (Convergence for Temporal Networks). Under 
Assumption on joint strong connectivity, the randomized 
incremental algorithm \24\ with a transition probability matrix 
given in Definition enjoys the following properties: 

(a) (Almost sure convergence) limfe_>.oo x{k) = with 
probability one. 

(b) (L^ convergence) limfc_>.oo \\x{k) — x 0 for any 

p > 0. 

Proof: The proof is similar to that of Theorem [T] but we 
need to re-elaborate Lemma for temporal networks. 

Under Assumption it follows from Lemma 1 in | |5^ that 

k+Q-l 

P{k)= J] P{1) 

l=k 

is ergodic for any fc C N. Since the network size of webpages 
is assumed to be fixed, P{1) can only take a finite number 
of values, i.e., P{1) G V where 7^ is a finite set containing 
all possible values of P{1). This implies that P{k) takes 
a finite number of values as well. By Theorem 3 in | [52) , 
we further obtain that the product of any finite number of 
P{k) is ergodic, i.e., ergodic as well for any 

finite TO. Together with Theorem 2 in | [5^ and the double 
stochasticity of P{k), it follows that the distribution of s{k) 
converges exponentially to a uniform distribution over the 
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Fig. 3. Random graph and its centralities. 


Fig. 4. Stepsize. 


set V. This implies that Lemmas and will hold under 
Definition and Assumption Then, we can easily establish 
the convergence for the incremental algorithm ([37|l as for static 
graphs. Specihcally, let 

C(fc) = a{k)Hs(^k){k)'^{ys{k) - H^(k){k)x{k)). 

This implies that 

e{k +!) = (/- a{k) ■ s(k))Hk) + ({k), (38) 

where e{k) = x{k) — x*®. Similarly, it can be shown that e(k) 
is uniformly bounded. Together with ( [TOl l and Lemma it 
follows that limfe_>.oo C(^) = 0- Finally, the proof is completed 
by using Corollary [T] in the Appendix. ■ 



Node Index 


Fig. 5. Scale-free network and its centralities. 


C. PageRank Computation Over Real Web Data 



VII. Simulations 

In this section, we report simulation results on the dis¬ 
tributed computation of betweenness centrality over a directed 
tree, and the degree, closeness centralities and PageRank over 
a randomly generated graph. 

A. Degree, Closeness, and PageRank Computation 

We consider two types of graph models; random graph 
model and scale-free network |[TJ, both of which are undirected 
with 50 nodes. The connections between two nodes of the 
random graph are denoted by a dot, and the probability of a 
connection is selected as one half, see Fig. For this graph, 
the normalized degree, closeness and eigenvector (PageRank) 
centralities are shown in Fig. In the PageRank computation, 
we test the distributed algorithm in ( [27] i on this randomly 
generated graph. As shown in Fig.j^ the inverse of the stepsize 
a{k) of each node indeed converges to the network size. That 
is, the network size can be distributedly estimated by each 
node. Similar results can be observed in the scale-free network, 
see Fig. 

B. Betweenness Computation Over a Directed Tree 

We perform the distributed computation of betweenness 
centrality via a directed tree in Fig. The normalized be¬ 
tweenness centralities are obtained in a finite time and are 
illustrated in Fig. where node 8 has the largest betweenness 
centrality. It is consistent with our intuition that this node 
controls the largest number of communications between other 
nodes. 


The Web data utilized in the simulations has been obtained 
from the database 0 collected by crawling Web pages of 
various universities. This database has been previously used as 
a benchmark for testing PageRank algorithms p4) . We have 
selected the data from Lincoln University in New Zealand 
for the year 2006. This web has 3,756 nodes and 31,718 
links corresponding to 684 domains. The largest is the main 
domain of the university (www.lincoln.ac.nz), consisting of 
2,467 pages. Other larger domains in this dataset contained, 
for example, 221, 101, 68, 24 pages. In this Web, a fairly large 
portion of the nodes are dangling nodes, e.g., there are 3,255 
dangling nodes (more than 85% of the total). Furthermore, two 
nodes with no incoming links have been removed because their 
effects on the PageRank values are negligible. The pages were 
indexed according to the domain/directory names in alphabetic 
order. 

To proceed with the PageRank computation, the Web was 
modified to obtain a column stochastic matrix. This modifi- 



Fig. 6. Directed tree and the betweenness centralities. 





























12 



Fig. 7. The PageRank of Web data. 


cation was done adding back links to dangling nodes. The 
resulting Web had 40,646 links. For this Web, the PageRank 
values were calculated (for comparison purposes) by the 
power method. About 40 iterations were sufficient for its 
convergence. We also implemented the PageRank Algorithm 2 
in ( |Z7| ) with unknown Web size. The results are shown in 
Fig. I^and are similar to those obtained in pT] . In particular, 
we observe that the pages with higher PageRank values are 
those corresponding to two clusters (with page indices around 
500 and 2,500) where many pages are linked to each other. 
The top two pages of PageRank values are in fact the “search” 
pages of the Lincoln University while the main website of the 
university is ranked third. 

Convergence of the algorithm is guaranteed by Theorem 
The algorithm basically has exponential convergence (Remark 
8 ), contrary to the algorithms proposed in | |24) which are 
based on time-averaging and have linear convergence rate. 
Furthermore, the algorithm does not require a common clock 
(Remark 3) and the Webpage randomization may be of 
Markovian-type instead of i.i.d., see Section V and Remark 7 
in particular. 


VIII. Conclusion 

In this paper, we have studied the distributed computation 
for the degree, closeness, betweenness centrality measures and 
PageRank. In particular, we proposed deterministic algorithms 
which converge in finite time for the degree, closeness, and 
betweenness centrality measures. For the PageRank problem, 
a randomized algorithm was devised to incrementally compute 
the PageRank. Different from the existing literature, this 
algorithm does not require to know the network size, which 
is typically difficult to obtain in a distributed way, and can be 
asymptotically estimated by each node. Extensive simulations 
using a classical benchmark were included to illustrate the 
theoretical results. Future work will be focused on extensions 
of this approach to study other important applications in net¬ 
works, including clock synchronization in wireless networks 
m and opinion dynamics in social networks | [5^ . 

Appendix 

By using Definition 1 and Assumption 1, the random 
process s{k) exponentially converges to a uniform stationary 
distribution over the set V. Without loss of generality, in this 
Appendix, we assume that the Markov process s{k) already 
starts from the uniform distribution. 


A. Proof of Lemma 

Before giving the proof, let Sp{X) be a set of random 
sequences X = {X/.} with A G [0,1) and p > 1 as 


Sp(A) := 


X: nn^-^. 


LP 




< MA 



(39) 


for some M > 0, and for all A: > i > 0. 

Proof: (a) Under Assumption it is easy to verify that 
s{k) is an ergodic and irreducible process. In addition, s{k) is 
stationary. This immediately implies that it is also a p-mixing 
process lig.By Q and ( pT| l, it follows that 

H^Hf < aef + (1 - rnfW.Wl 

N 

= 1 -f (1 - m)2 ^ wfj 

i=i 

< I +J 2 

< N, (40) 


which implies that 0 < 1/W • ^ I- 

Let Xk be the a-algebra generated by events associated 
to random variables {s(0 ),..., s(fc)} and Aniin(3f) be the 
minimum eigenvalue of a positive semi-definite matrix X. 
Since s{k) has a uniform distribution over V, we obtain that 

= (41) 

where H is defined in ( |28] l. 

By Lemmaj^ all the eigenvalues of are 

strictly positive. In light of Theorem 2.3 in p^ , there exists 
a > 0 and A S (0,1) such that {Xk} G 5i(A) where 


Xk ■— An 


E 


{h- 


(k+l)h 

l)iV ^ 

' i—kh-\-l 


^s(i)^s{i) iJ'kh 


and 5i(A) is defined in (39 1 . Together with Theorem 2.1 in 
1 , it follows that 


^ £ Mn and o = 

That is, there exists a positive M > 0 and p = A^“ G (0,1) 
satisfying Q, which completes the proof of part (a). 

(b) For ease of notation, let t = Given any 

sufficiently large k, it follows from ||a4i3|| < ||A||||i7|| and 
||$(fc, j -I- tN)\\ < 1 that 


t 

||^(fc,7)|| < n + + 1)W|| 

7=1 


which immediately implies that 
1 1 ^, 

-•log||$(fc,j)|| < -y^log||$(j-fLW,j-f(i-l)W)||. (42) 

^ 7 = 1 

Since s{k) is ergodic and stationary, it follows from the 
Ergodic Theorem | |58) that 

1 , ^, 

lim -y^log||$(j-fLiV,j-f(i-l)X)|| = E[log ||$(iV, 0)||] 

t—^oo t 

( 43 ) 
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with probability one. 

It is clear from Lemma S that % is positive debnite. Then 
span{i7f,..., = K , i.e., {Hi ,..., Hjj} generates 

Let II $ 0 1! = II nti(^ - 1/^" ■ Jointly with 

Lemma 3.52 in |j^, we obtain that ||<i>o|| < 1- By the 
ergodicity and stationarity of s{k), it follows that 

po := P{s(l) = l,...,s(iV) = iV}>0. 


As in part (b) of Lemma there exist sufficiently large kz > 
k 2 ,ji > k 2 and pi G (0,1) such that 

j)|| < if fc - j > fto and j > ji. (48) 

Similar to part (c) of Lemma there exists a positive Mj^ 
such that 


Note that ||<i)(At, 0)|| < 1 under any realizations of 

s(l),..., s{N), we obtain that 


E[log||$(7V,0)||] ^pologll^oll < 0. 


(44) 


For any e S (0, —po log ||‘ho||), it follows from (43 1 that 
there exists a sufficiently large to > 0 and t > to such that 

1 ■ * 

y + + ^Folog||$oli+e- (45) 

^ i=l 

Let ry = exp(po log H^oH + e) < 1 and fcg = toN, it follows 
from (42 1 that ||$(fc,j)|| < ry* if fc — j > ko- 

(c) For any sufficiently large k, let t be the smallest integer 
such that t > k/k\ — l, i.e. t = \k/ki — l\, where ki = fco + 1. 
This implies that k — tki < ki. Noting that ||<i)(A;, j)|| < 1, it 
follows from part (b) that 

k k — tki t 


3=0 


< 


< 


3=0 

t 

fci + fci E ^7 

3 = 1 

1 


i=i 


L^J 




1 + 


l-7y 


That is, j)ll 1® uniformly upper bounded by 

^i(l + T^)’ which completes the proof. ■ 


B. Proof of Theorem 

In light of Lemmas |3]|^ we have the following result, which 
is central to the proof of Theorem 

Corollary 1. Let the transition matrix be $(fc + l,_y) = 
(/- aik) ■ j), j < k and <^{k,k) = I. With 

probability one, it holds that 


k 

lim E ll^(^>i)ll < oo- (46) 

k—^oo 

J^O 


Proof: We hrst note from (40i that there exists a suffi¬ 
ciently small e > 0 such that HjHf + e < A^. In particular. 


N 

Q<e<N—I — — to)^ miniy^ wl:}- 

igV ^ ^ 

3 = 1 

By Lemma 1^ there exists a sufficiently large ^2 such that 

^<a(/c)< ^,Vfc>fc2. (47) 

Combining the above relations, we obtain that 

0 < a{k)Hjf^k)Hs(k) <I,yk> k2. 


k 

E mk,j)\\<Mi„\fk>ji. (49) 


For all A: > ji, we obtain that 

Eii5(a:,j)ii = Eii$(fc,j)ii+ E 

3=0 3=0 3=31 + 1 

< EP(ji,j)ii+ E 

3=0 3=31 + 1 

< ji + 1 + Mj ^ , (50) 

where we used the fact that ||$(fc, j)|| < 1 for all k > j. ■ 
Proof of Theorem m (a) By ^ and ( [2T] ), it is obvious 
that 2 /g(fc) = Let e{k) := xik) - x and y„(k) ■= 

ys{k) — m/N, it follows from (25i that 

e(A:-fl) = {I-a{k)-Hj^k)Hs{k))e{k)+a{k)Hj'^k)ys{k), (51) 

which can also be written as 


e{k + l) = $(fc-fl,0)e(0)-|-E4’(^,j)a(j)Lfs^j)2/s(i)- (52) 


3=0 


By (48 I, it is obvious that the hrst term of the right hand 
side converges to zero with probability one. 

By Lemma 1^ it follows that lim/j_>.oo a(fc) = 1/7V and 
limfc_>.oo ys{k) = 0 with probability one. Jointly with Corollary 
[2 it follows from Toeplitz’s Lemma | |58) that the second term 
in the right hand side of (|52)l converges to zero with probability 


(b) By the hrst part, we know that sup/^gp^ ||e(fc)|| < oo. 
Together with the Dominated Convergence Theorem | [58| , it 
follows that 


lim ||x(fc) —a;| 


iP = E[ lim I 

k—foo 


lip 

(fc)-x||g) =0,(53) 


which completes the proof. 
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